Incremental Data Quality in the Data Warehouse
نویسنده
چکیده
The data warehouse is the cornerstone for the production of business knowledge in the organization. The foundation of the quality of the business knowledge is the quality of the data in the data warehouse. Determination of dimensions of data quality in the data warehouse has been obtained through the intuitive, the empirical and the ontological approaches. The first point of this working paper is that data quality is not a static measure and that awareness of the data quality dimensions is a prerequisite to improve the data quality. The second point is that selection is the cornerstone of data quality in the data warehouse in relation to the quality dimensions. Thirdly, that post-load improvement of the data quality is obtainable. Metadata can be added incrementally containing information on the use of data – including the users' selections within
منابع مشابه
افزایش سرعت نگهداری افزایشی دید با استفاده از الگوریتم فاخته
Data warehouse is a repository of integrated data that is collected from various sources. Data warehouse has a capability of maintaining data from various sources in its view form. So, the view should be maintained and updated during changes of sources. Since the increase in updates may cause costly overhead, it is necessary to update views with high accuracy. Optimal Delta Evaluation method is...
متن کاملFormalizing ETL Jobs for Incremental Loading of Data Warehouses
Extract-transform-load (ETL) tools are primarily designed for data warehouse loading, i.e. to perform physical data integration. When the operational data sources happen to change, the data warehouse gets stale. To ensure data timeliness, the data warehouse is refreshed on a periodical basis. The naive approach of simply reloading the data warehouse is obviously inefficient. Typically, only a s...
متن کاملIncremental Load in a Data Warehousing Environment
Incremental load is an important factor for successful data warehousing. Lack of standardized incremental refresh methodologies can lead to poor analytical results, which can be unacceptable to an organization’s analytical community. Successful data warehouse implementation depends on consistent metadata as well as incremental data load techniques. If consistent load timestamps are maintained a...
متن کاملDesign of Incremental Data Mining Algorithm [IMA] and its Performance Evaluation over Cumbersome Warehouse
In the current world of large datawarehouse there is a need of developing and incremental data mining algorithm which can mine data from large warehouse in lesser time utilizing less memory .In the current study the emphasis is laid on developing IMA incremental data mining algorithm and is a performance evaluation over existing incremental data mining algorithm[DELTA] using MATLAB and SPSS 16....
متن کاملIncremental Maintenance of Object-Oriented Views in a Warehousing Environment
Data warehousing is an approach to data integration in which integrated information is stored in a data warehouse for direct querying and analysis. To provide fast access, a data warehouse stores materialized views defined over data from its data sources. As a result, a data warehouse needs to be maintained to keep its contents consistent with the contents of its data sources. Incremental maint...
متن کامل